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(57)Abstract: 

PROBLEM TO BE SOLVED: To reduce the number of SIMD 
type instructions without damaging the diversity of arithmetic 
functions. 

SOLUTION: An instruction decoder 5 discriminates a data 
type from a loading instruction and stores the data type in 
data type storage area 3a to 3c corresponding to registers 2a 
to 2c storing data. An SIMD computing element 4 determines 
the type of the data to be a computation object on the basis 
of the data stored in the data type storage areas 3a to 3c. 
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* NOTICES * 

JPO and NCI PI are not responsible for any 
damages caused by the use of this translation. 

l.This document has been translated by computer. So the translation may not reflect the original precisely. 

2 **** s h ows t he word which can not be translated. 
3. In the drawings, any words are not translated. 



CLAIMS 



[Claim(s)] 

[Claim 1] The data memory which memorizes data, and the input register which loads the data specified 
by the load instruction from said data memory, A data-type assignment means to specify the data type of 
the data specified by said load instruction, The data-type storage region which said input register is made 
to correspond and memorizes the data type specified with said data-type assignment means, The processor 
characterized by having the computing element which calculates the data loaded to said input register 
based on the data type memorized in said data-type storage region. 

[Claim 2] The data memory which memorizes data, and the input register which loads the data specified 
by the load instruction from said data memory, A data-type assignment means to specify the data type of 
the data specified by said load instruction, The 1 st data-type storage region which said input register is 
made to correspond and memorizes the data type specified with said data-type assignment means, The 
computing element which calculates the data loaded to said input register based on the data type 
memorized in said data-type storage region, The processor characterized by having the output register 
which memorizes the result of an operation of the data based on said computing element, and the 2nd 
data-type storage region which the data type corresponding to the result of an operation of said data is 
made to correspond to said output register, and memorizes it. 

[Claim 3] The processor characterized by having an operation means to calculate the input register which 
memorizes the data with which data types differ, and the data with which said data types differ, and the 
output register which memorizes the result of an operation by said operation means. 
[Claim 4] The data-type specification method characterized by specifying the data type of a SIMD mold 
instruction by the load instruction. 

[Claim 5] Said load instruction is a data-type specification method according to claim 4 characterized by 
being prepared for every data type. 

[Claim 6] The data load approach characterized by having the step which loads the data specified by the 
load instruction to a register, and the step which said register is made to correspond and memorizes the 
data type corresponding to said loaded data. 

[Claim 7] The operation approach characterized by having the step which loads the data specified by the 
load instruction to an input register, the step which said input register is made to correspond and 
memorizes the data type corresponding to said loaded data, the step which inputs into a computing 
element the data loaded to said input register with said data type, and the step which calculates said data 
based on said data type. 

[Claim 8] The step which loads the data specified by the load instruction to an input register, The step 
which said input register is made to correspond and memorizes the data type corresponding to said loaded 
data, The step which inputs into a computing element the data loaded to said input register with said data 
type, The step which calculates said data based on said data type, The renewal approach of a data type 
characterized by having the step which memorizes the result of an operation of said data to an output 
register, and the step which the data type corresponding to the result of an operation of said data is made 
to correspond to said output register, and memorizes it. 

[Claim 9] The instruction program characterized by making a computer perform the step which specifies 
the data type of a SIMD mold instruction by the load instruction, the step which loads the data specified 
by said load instruction to a register, and the step which said register is made to correspond and 
memorizes the data type specified by said load instruction. 

[Claim 10] The step which specifies the data type of a SIMD mold instruction by the load instruction, The 
step which loads the data specified by said load instruction to a register, The step which said register is 
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made to correspond and memorizes the data type specified by said load instruction, The instruction 
program characterized by making a computer perform the step which inputs into a computing element the 
data loaded to said register with said data type, and the step which makes data processing perform to said 
computing element based on said data type. 

[Claim 1 1] The step which specifies the data type of a SIMD mold instruction by the load instruction, The 
step which loads the data specified by said load instruction to an input register, The step which said input 
register is made to correspond and memorizes the data type specified by said load instruction, The step 
which inputs into a computing element the data loaded to said input register with said data type, The step 
which makes data processing perform to said computing element based on said data type, The instruction 
program characterized by making a computer perform the step which memorizes said data-processing 
result to an output register, and the step which the data type corresponding to said data-processing result is 
made to correspond to said output register, and memorizes it. 



[Translation done.] 
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DETAILED DESCRIPTION 



[Detailed Description of the Invention] 
[0001] 

[Field of the Invention] Especially this invention is applied to a SIMD (Single Instruction Multi Data) 
mold processor about a processor, a data-type specification method, the data load approach, the operation 
approach, the renewal approach of a data type, and an instruction program, and is suitable. 
[0002] 

[Description of the Prior Art] In the conventional processor, in order to raise data processing capacity, 
there are some which adopted SIMD mold data format. In this SIMD mold processor, the SIMD mold 
operation instruction for making two or more data process with one instruction is prepared, and the SIMD 
mold parallel operation machine for executing this SIMD mold operation instruction is formed. 
[0003] Ultra which has the Pentium (trademark) processor which adopted the MMX technology of for 
example, U.S. Intel, and a VIS instruction of the U.S. Sun Microsystem company as such a SIMD mold 
processor There is a Sparc processor etc. Drawing 6 is drawing showing the data type of the conventional 
MMX technology. In drawin g 6 , four kinds of data format, pack DOBAITO (eight 8 bit data) of drawin g 

6 (a), pack DOWADO (four 16 bit data) of drawin g 6 (b), the PAKKUDO double word (two 32 bit data) 
of drawing 6 (c), and the quad WORD (one 64 bit data) of drawing 6 (d), is prepared as a data format 
stored in a 64-bit MMX register. 

[0004] Thus, in a SIMD mold processor, two or more data format stored in a register exists. For this 
reason, in order to perform the SIMD operation of such data, the SIMD mold instruction is prepared for 
every data type. For example, by MMX technology, in order to perform addition, the add instruction is 
prepared for three kinds of every data types, pack DOBAITO of drawing 6 (a), pack DOWADO of 
drawing 6 (b), and the PAKKUDO double word of drawing 6 (c). Moreover, if the addition with 
SACHURESHON which prevents the underflow of the result of an operation and overflow is included, 
seven kinds of instructions are prepared only by the add instruction. Furthermore, it is prepared for every 
data type also about the instruction of not only addition but subtraction, multiplication, a shift, etc. 
[0005] For this reason, much instructions were defined by the conventional SIMD mold processor in order 
to execute a SIMD mold instruction. 
[0006] 

[Problem(s) to be Solved by the Invention] However, since many numbers of bits needed to be assigned to 
the operand for distinguishing these instructions when the number of instructions increased, the whole 
instruction code increased and there was a problem that the code effectiveness of a program fell. Drawing 

7 is drawing showing an example of the conventional instruction code. 

[0007] In drawing 7 , instruction code consists of 32 bits and can specify Operand OP, an output register 
rt, and input registers rs and rd by this instruction code. Here, if the number of instructions increases, 
many numbers of bits will be needed for Operand OP, and the whole instruction code will increase. 
[0008] On the other hand, although there is also the approach of offsetting the increment of the number of 
bits of Operand OP by reducing the number of bits assigned to an output register rt and input registers rs 
and rd in order to prevent decline in the code effectiveness of a program, the degree of freedom of a 
register select is restrained by this approach. Then, the purpose of this invention is offering the processor 
which can reduce the number of instructions of a SIMD mold instruction, a data-type specification 
method, the data load approach, the operation approach, the renewal approach of a data type, and an 
instruction program, without spoiling the versatility of a calculation function. 
[0009] 

[Means for Solving the Problem] In order to solve the technical problem mentioned above, according to 
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the processor according to claim 1 The data memory which memorizes data, and the input register which 
loads the data specified by the load instruction from said data memory, A data-type assignment means to 
specify the data type of the data specified by said load instruction, The data-type storage region which 
said input register is made to correspond and memorizes the data type specified with said data-type 
assignment means, It is characterized by having the computing element which calculates the data loaded 
to said input register based on the data type memorized in said data-type storage region. 
[0010] Thereby, only by preparing a load instruction for every data type, since it becomes possible to 
distinguish a data type and it becomes unnecessary to prepare the instruction corresponding to a data type 
for every operation classification, it becomes possible to reduce the number of instructions of a SIMD 
mold instruction. Furthermore, a computing element only refers to a data-type storage region, becomes 
possible [ distinguishing the data type of the data used as the candidate for an operation ], and it becomes 
possible [ performing various operations with the small number of instructions ] from it becoming 
possible to perform the operation according to a data type, without giving a different instruction for every 
data type. 

[001 1] Moreover, the data memory which memorizes data according to the processor according to claim 
2, The input register which loads the data specified by the load instruction from said data memory, A 
data-type assignment means to specify the data type of the data specified by said load instruction, The 1st 
data-type storage region which said input register is made to correspond and memorizes the data type 
specified with said data-type assignment means, The computing element which calculates the data loaded 
to said input register based on the data type memorized in said data-type storage region, It is characterized 
by having the output register which memorizes the result of an operation of the data based on said 
computing element, and the 2nd data-type storage region which the data type corresponding to the result 
of an operation of said data is made to correspond to said output register, and memorizes it. 
[0012] When this becomes possible to update a data type according to the result of an operation and a data 
type changes by data processing, whenever it can maintain the adjustment of a data type and operation 
instruction is executed, the need of respecifying a data type can be abolished. Moreover, according to the 
processor according to claim 3, it is characterized by having an operation means to calculate the input 
register which memorizes the data with which data types differ, and the data with which said data types 
differ, and the output register which memorizes the result of an operation by said operation means. 
[0013] It becomes possible to increase the variation of a SIMD operation, without this becoming possible 
[ defining an operation based on the combination of a data type ], and making the number of instructions 
increase. Moreover, according to the data-type specification method according to claim 4, it is 
characterized by specifying the data type of a SIMD mold instruction by the load instruction. 
[0014] Since it becomes possible to distinguish a data type and it becomes unnecessary to prepare the 
instruction corresponding to a data type for every operation classification by this in case data are loaded to 
a register, it becomes possible to reduce the number of instructions of a SIMD mold instruction. 
Moreover, according to the data-type specification method according to claim 5, it is characterized by 
preparing said load instruction for every data type. 

[0015] Thereby, without preparing the instruction corresponding to a data type for every operation 
classification, it becomes possible to distinguish a data type and it becomes possible to reduce the number 
of instructions of a SIMD mold instruction. Moreover, according to the data load approach according to 
claim 6, it is characterized by having the step which loads the data specified by the load instruction to a 
register, and the step which said register is made to correspond and memorizes the data type 
corresponding to said loaded data. 

[0016] In case this incorporates the data loaded to the register, the data type can also be incorporated to 
coincidence, and since it becomes possible to perform the operation according to a data type only by 
giving a single instruction, it becomes possible to reduce the number of instructions of a SIMD mold 
instruction. Moreover, it is characterized by to have the step which loads the data specified by the load 
instruction to an input register, the step which said input register is made to correspond and memorizes the 
data type corresponding to said loaded data, the step which inputs into a computing element the data 
loaded to said input register with said data type, and the step which calculates said data based on said data 
type according to the operation approach according to claim 7. 

[0017] It becomes possible only by this preparing a load instruction for every data type to distinguish a 
data type, and while becoming possible to reduce the number of instructions of a SIMD mold instruction, 
it becomes possible from it becoming possible to perform the operation according to a data type only by 
giving a single instruction to perform various operations with the small number of instructions. 
[001 8] Moreover, the step which loads the data specified by the load instruction to an input register 
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according to the renewal approach of a data type according to claim 8, The step which said input register 
is made to correspond and memorizes the data type corresponding to said loaded data, The step which 
inputs into a computing element the data loaded to said input register with said data type, It is 
characterized by having the step which calculates said data based on said data type, the step which 
memorizes the result of an operation of said data to an output register, and the step which the data type 
corresponding to the result of an operation of said data is made to correspond to said output register, and 
memorizes it. 

[0019] This becomes possible [ updating a data type according to the result of an operation ], and when a 
data type changes by data processing, the adjustment of a data type can be maintained. Moreover, 
according to the instruction program according to claim 9, it is characterized by making a computer 
perform the step which specifies the data type of a SIMD mold instruction by the load instruction, the step 
which loads the data specified by said load instruction to a register, and the step which said register is 
made to correspond and memorizes the data type specified by said load instruction. 
[0020] Since it becomes possible to distinguish a data type and it becomes unnecessary to prepare the 
instruction corresponding to a data type for every operation classification by this in case data are loaded to 
a register, it becomes possible to reduce the number of instructions of a SIMD mold instruction. 
Moreover, the step which specifies the data type of a SIMD mold instruction by the load instruction 
according to the instruction program according to claim 10, The step which loads the data specified by 
said load instruction to a register, The step which said register is made to correspond and memorizes the 
data type specified by said load instruction, It is characterized by making a computer perform the step 
which inputs into a computing element the data loaded to said register with said data type, and the step 
which makes data processing perform to said computing element based on said data type. 
[0021] It becomes possible only by this preparing a load instruction for every data type to distinguish a 
data type, and while becoming possible to reduce the number of instructions of a SIMD mold instruction, 
it becomes possible from it becoming possible to perform the operation according to a data type only by 
giving a single instruction to perform various operations with the small number of instructions. 
[0022] Moreover, the step which specifies the data type of a SIMD mold instruction by the load 
instruction according to the instruction program according to claim 1 1 , The step which loads the data 
specified by said load instruction to an input register, The step which said input register is made to 
correspond and memorizes the data type specified by said load instruction, The step which inputs into a 
computing element the data loaded to said input register with said data type, The step which makes data 
processing perform to said computing element based on said data type, It is characterized by making a 
computer perform the step which memorizes said data-processing result to an output register, and the step 
which the data type corresponding to said data-processing result is made to correspond to said output 
register, and memorizes it. 

[0023] This becomes possible [ updating a data type according to the result of an operation ], and when a 

data type changes by data processing, the adjustment of a data type can be maintained. 

[0024] 

[Embodiment of the Invention] Hereafter, the processor concerning the operation gestalt of this invention 
is explained, referring to a drawing. Drawing 1 is the block diagram showing the outline configuration of 
the processor concerning the 1 st operation gestalt of this invention. 

[0025] In drawing 1 , data are stored in data memory 1, according to a load instruction, data are loaded to 
a register file 2, or the data of a register file 2 are stored. While Registers 2a-2c are formed, the data-type 
storage region 3 is established in the register file 2. Here, the data-type storage region 3 is established in 
every register 2a - 2c, and the data-type storage regions 3a-3c are formed corresponding to these registers 
2a-2c. 

[0026] Dx awin gJ2 is drawing showing the example of a configuration of the registers 2a-2c concerning the 
1st operation gestalt of this invention. In drawin g 2 , the 32-bit data storage area 1 1 is established in 
Registers 2a-2c, and the data-type storage region 12 for 2 bits is added to the data storage area 1 1 . 
[0027] Here, as a data type, four kinds, 4 8 bits packed data, 2 16 bits packed data, 32-bit data, and 
floating a small number of point formal data, are established, and "11" is assigned to 4 8-bit packed data 
to "10" and floating a small number of point formal data as discernment data for identifying a data type at 
"00" and 2 1 6-bit packed data at "0 1 " and 32-bit data. 

[0028] When 4 8-bit packed data are stored in a data storage area 1 1 , and in the data-type storage region 
12 When "00" is stored corresponding to the data storage area 1 1 and 2 packed data which are 16 bits are 
stored in a data storage area 1 1, in the data-type storage region 12 When "01" is stored corresponding to 
the data storage area 1 1 and the data which are 32 bits are stored in a data storage area 1 1, in the data-type 
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storage region 12 When "10" is stored corresponding to the data storage area 1 1 and floating a small 
number of point formal data are stored in a data storage area 1 1 , "11 " is stored in the data-type storage 
region 12 corresponding to the data storage area 11. 

[0029] The SIMD computing element 4 is connected to the register file 2, the data-type storage region 3, 
and the instruction decoder 5. And the data memorized by the register file 2 are read with the data type 
memorized in the data -type storage region 3, and the operation according to the data type is performed. 
And while outputting the result of an operation to a register file 2, the data type corresponding to the 
result of an operation is outputted to the data-type storage region 3. 

[0030] An instruction decoder 5 generates the control signal which controls a register file 2, the data-type 
storage region 3, and the SIMD computing element 4 based on the instruction code read from program 
memory 6. Instruction code is stored in program memory 6, and instruction code is outputted to an 
instruction decoder 5. Here, a load instruction is prepared for every data type. 

[003 1] Drawing 3 is drawing showing an example of the instruction concerning the 1st operation gestalt 
of this invention. In drawing 3 , as for a load instruction, four kinds of instructions LDPB, LDPW, and 
LDDW and LDFS are prepared corresponding to 4 [ 8-bit ] of drawing 2 packed data, 2 16 bits packed 
data, 32-bit data, and floating a small number of point formal data. Moreover, operation instruction, such 
as addition and subtraction, is prepared common to a different data type. 

[0032] Next, actuation of the processor of draw ing 1 is explained. If a load instruction is read from 
program memory 6, (**) and an instruction decoder 5 will decode a load instruction. And the data read 
from data memory 1 are stored in the registers 2a-2c specified by (**) and the load instruction (**). Here, 
a load instruction is prepared for every data type, and an instruction decoder 5 distinguishes a data type 
from a load instruction. And the data-type storage regions 3a-3c corresponding to the registers 2a-2c with 
which data were memorized are made to memorize the data type (**). 

[0033] For example, the mnemonic of a load instruction is LDPB of drawin g 3 , by this load instruction, 
supposing register 2a in a register file 2 is specified as an input register, by decoding this mnemonic 
LDPB, the data used as the candidate for a load will distinguish an instruction decoder 5 from 4 8-bit 
packed data of drawin g 2 , and it will acquire "00" as discernment data of that data type. 
[0034] And an instruction decoder 5 makes data-type storage region 3a prepared corresponding to this 
register 2a memorize "00" while it makes these 4 8-bit packed data read from data memory 1 and making 
register 2a of a register file 2 memorize it. This becomes possible to distinguish the data type of the data 
memorized by Registers 2a-2c only by preparing a load instruction for every data type. 
[0035] Next, if operation instruction is read from program memory 6, (**) and an instruction decoder 5 
will decode the operation instruction, and will output a control signal to the SIMD computing element 4 
(**). If a control signal is outputted from an instruction decoder 5, the SIMD computing element 4 will 
incorporate a data type from the data-type storage regions 3a-3c corresponding to (**) and its registers 2a- 
2c while incorporating data from the registers 2a-2c specified there (**). And while performing the SIMD 
operation of the data according to the data type and storing the result of an operation in the specified 
registers 2a-2c, (**) and the data-type storage regions 3a-3c corresponding to the registers 2a-2c of the 
output destination change are made to memorize a data type (round head 10). 

[0036] For example, an add instruction shall be read from program memory 6, register 2a in a register file 
2 and 2b shall be specified as an input register by this add instruction, and register 2c shall be specified as 
an output register. In this case, an instruction decoder 5 specifies register 2a and 2b as an input register, 
and specifies register 2c as an output register while it performs addition directions to the SIMD computing 
element 4. 

[0037] The SIMD computing element 4 reads a data type from the data-type storage regions 3a and 3b 
corresponding to register 2a and 2b while reading data from register 2a and 2b, if register 2a and 2b are 
specified as an input register. And the SIMD computing element 4 outputs a data type to data-type storage 
region 3c corresponding to that register 2c while it adds the data of register 2a and 2b and outputs that 
addition result to register 2c according to this data type. 

[0038] Here, operation instruction is prepared common to a data type, and an instruction decoder 5 sends 
the signal which directs an input register and an output register to a register file 2 while sending only 
directions of addition, multiplication, etc. to the SIMD computing element 4, if operation instruction is 
sent from program memory 6. And the SIMD computing element 4 judges the data type of the data used 
as the candidate for an operation based on the data memorized in the data-type storage regions 3a-3c 
rather than judges the data type of the data used as the candidate for an operation based on directions of 
the operation from an instruction decoder 5. 

[0039] It becomes possible for it to become unnecessary to prepare the instruction corresponding to a data 
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type for every operation classification, such as addition, subtraction, and multiplication, and to reduce the 
number of instructions of a SIMD mold instruction by this. In addition, you may make it it not only to 
specify the data type memorized in the data-type storage region 3 by the load instruction, but update it 
based on the result of an operation by the SIMD computing element 4. 

[0040] Drawing 4 is drawing showing the updating approach of the data type concerning the 2nd 
operation gestalt of this invention. In drawin g 4 , while data-type discernment data "01" are memorized by 
data-type storage region 12a while 2 16-bit packed data aO and al are memorized by data storage area 
11a, and 2 1 6-bit packed data bO and bl are memorized by data storage area 1 lb, data-type discernment 
data "01" shall be memorized by data-type storage region 12b, and the multiplication of these data shall 
be performed. 

[0041] In this case, while 32-bit data are generated by the multiplication of aO and bO and multiplication 
result aOxbO [ those 32 bits ] is memorized by data storage area 1 lc, 32-bit data are generated by the 
multiplication of al and bl, and multiplication result alxbl [ those 32 bits ] is memorized at I Id of data 
storage areas. Here, since the data with which the data memorized in data storage areas 11a and 1 lb are 
memorized in 2 16-bit packed data and data storage areas 1 lc and 1 Id are 32-bit data, data types differ 
before and after the operation. 

[0042] For this reason, the SIMD computing element 4 updates the data-type discernment data "01" 
memorized in the data-type storage regions 12a and 12b, and memorizes the data-type discernment data 
"10" corresponding to 32-bit data in the data-type storage regions 12c and 12d. Without giving a different 
instruction for every data type by this, when a data type changes according to the result of an operation, it 
becomes possible to maintain the adjustment of a data type, and it becomes possible only by referring to 
the data-type storage regions 12c and 12d to perform subsequent processing appropriately. 
[0043] Drawing 5 is drawing showing the operation approach concerning the 3rd operation gestalt of this 
invention. In drawing 5 , while the 32-bit data a are memorized by data storage area 21a, 4 8-bit packed 
data bO, bl, b2, and b3 are memorized by data storage area 21b. Addition with the 32 bits data a and 4 8- 
bit packed data bO, bl, b2, and b3 is defined as addition with 8 bits and 4 8-bit packed data bO, bl, b2, and 
b3 of the 32-bit data a, and it enables it to add these data with the SIMD computing element 4 of drawing 
1 here. 

[0044] And if an add instruction is outputted to the SIMD computing element 4, the SIMD computing 
element 4 will perform addition with the data a of data storage area 21a, and the data bO, bl, b2, and b3 of 
data storage area 21b according to the above-mentioned definition, and ~ ******** __ a s an addition 
result of data types, if a+bO, a+bl, a+bl, and a+bl are obtained, these addition results will be memorized 
to data storage area 21c as 4 8-bit packed data. 

[0045] Thereby, without increasing the number of instructions, BARIESHON of an operation can be 

increased and it becomes possible to raise the code effectiveness of a program. 

[0046] 

[Effect of the Invention] It becomes possible to reduce the number of instructions of a SIMD mold 
instruction, without according to this invention, becoming possible to distinguish a data type and spoiling 
the versatility of data processing only by preparing a load instruction for every data type, as explained 
above. 



[Translation done.] 
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(2) 

/ 

[#ffFKt#©SSH] 

[ii*jsi] x-^iEtt-rsx-^y^y 

a - K^A^i-v^^i. 

HfrtEo- m^x-®feztr?>?-9 <Dr-9 9 4 
fe-rzr-? 9 4 t . 

fttfEx- -f y°mfe^&X*mfe £ Hfcx - 9 9 4~7° 
^aEtl«^ 

S^^t. liUfEA^u^x^Kiu- K£*Lfcx-^» 
»SrtT 5 ttJMIi: r. k &&mt -T y 

[ii*3S2] x-^£fEiti-3x*-^ *y 

n - Kift^ T«$lx5f-^?r Suffix - ^^^y*^ 
n— Ki^A^i^x^ k » 

MfEo- K^^^HSx-^cDx-^-Y ^SrJt 

tfjiEx- ^ ^ ^ -rm^wzxmfe £nfcx - 994? 
iutEx- **-r vtettffl^i-feit £ *tfcx- 9 9 4-?\z 

t N 

u v 5 * ^ k ^ $ * r 5 s 2 x - * ^ a -ym 

[it*3i3] x-^^cD^/iSx'-^tEti-f 5 30 

huiex - * ^ 4 -7<Dmte z> x - * (omn *n o m%-^& 
k, 

S:£ti,TV^;L £ £#mii-3if*Jl4fEiffirox-** 40 
^tcn- K-f 3 xx 

StJfEn- K^^fcx-^t-M^'TSx-^^'l'^Sr, tU 
fEi^x^MJS£itTfEt§-f 3xx yfk *m*-Z> Z. 
i: k i- 5 x - n - ife, 

w\Uv-V't$Mc?-9\znfc-tZ>?-9 9 4-7"%, ffj 
fEA^^v?^^ic>p)-J^$-&T?Eti-r5^7 i 5'7't , so 



2 

tUfEA^ u-s7~9\zt2 — K$ Hfc x - * £ , mifEx- 2 

mttr?-9 9 4y\z^\^x. mmr-^comn^ffy 
xx y-yt *mx.z> z. t ^mt-r^m^mo 

BtifEn- K£Hfcx-^}c^1-3x-**'1':7 P £, mi 
lEA^ v?x 9 \z.n £ -grT fEtt-f" 3 x x ^ f k . 
tufEA^? ^ »w n - K $ tilt?- 9 * . HtjfEx - ^ 

HufEx-^W^g^ic^Lfcx-^ ^^7°Sr, SufE 

1/ ^ $ itxfEiti- 5 ^ x y -f t &m x- Z> 

lfS*^9] S I MD§y^<©x — 924 -7"^t2— K 

J; «9 5 *x s/ 7°i . 
BUFEn- K^-T?#^$tu5x-^^l/v?^^(cci- K 

SufEo— K^T^g^^tlSx— 9 94-7"*. HUfEl^v 5 
^ TfElei- 5 ^ x y rf t * => > tf 3. - ^ |c 

[|f*JglO] S I MDS^Wf- 9 94 
K^i-J: i9f|^-rs^x-y^t , 
ttrfEci— K^T-^^tuSx — ^^u^^^lCD— K 

MIEd- K^-e^S$tv5x-^^l'7 p ^, HufEUv? 
^ ^ testis $iirTfE1t-r 5 ^ x y -f t , 
SflfEWv'^^tcn— K£*l<fcx — ^5:. HufEx — 9 9 4 
■7k k t> (c^^lcA^-TS xx ^i: . 
StifEx - ^ ^-Y T'icS-^v X "C ttlfEM^icaEfffeiaSr 

[IS*3Sll] S I MDI^Wt-^^^^p- 
K^^-tc J: «9 fgJE-T 5 xx -y -fk . 
mifEn— K^T^fe^tL-Sx-^SrA^u^^^tco 
- Kl-^xxy^i, 

fltifED- K^T'Jg^$H5x-^^-r^Sr. MfEA^) 
u >5 ^ ^ (C *fJS $ ^ T IEB -r 5 x x y -7 s i s 
tulEA^i^v'x^icn— K^^fcx — huIEx — ^ 
^^^iitli^^tcA^-rSxxs/^i, 
HufEx-^ 9 4 ^i^S<5^T , tfffEi^^(c^*a®5r 
fr^-erSxxs/^i, 

bu fE^^H^* tti ts \s V s x * tc fEU-r S^f^ 

siifE^^*aa^mic^u7tx-^^'i''7 p ^, mrfEttj^ 

u v=x fcatfg $ -ttTlElt-r 5 X x -y 7° i $r 3 > t° ^, - 



*# m 2003-44272 (P2003-44272A) 



3 

[0 00 1 ] 

9 9-Cf-^.^^iil^^v f ^9yMzm^. 
SIMD (Single Instruction M 
u 1 t i Data) s/IH-ilffi LtM^it 

[0 00 2] 

£ fa ±£-ttSfc S I MDSf- fM^Um Vtz 

h<DfrhZ>o rcos iMDl7'Dtytm lifc©^" 

^tfRMZtl. Z<DS I MDM^^^HtT-TSfcfe 
CO S I MDSM^iJil^^tmibax-Ct^ 
[0 00 3] :©i I MDS/ntyfi LT. 

Y-f^cv'XfAttCVIS^^oUl tra S 
pa rc/Dt^t'iCTfcS. t¥*(DMMX 
f^/p v^cox — ^ -f 7°Sr^1-|2|-efo § 0 El 6 (c*jv > 
t, 6 4 t's/ hcoMMX u-v ? ^^tc^*rt$ix-5 

-T — fB^t LX, IH6 (a) CD/^y^ FVW h- ( 8 AS 
©8t'y hf-^) , HI6 (b) (D/<-y9 K!7— K (4 
i©16fy hf-^) . |2l 6 (c) <D/<>y9 h'^^/U 
y— K (2i©3 2 fc> hf-^) , 0 6 (d) co^ y 
yK?-K (l(|©6 4t'-7 hf-^) <DAW&<D*r— 
9BRtfmMtStiT^Z> a 
[0004] S I MDSyniryf 

|H6 (a) cd/nj/^ K'W K HI 6 (b) co^-y 
9 YV-Y\ W6 (c) (Ds<y-7 K^*:7VH7- 

£ K =>■ V a # JPlf * if fc-gf * 5 t , flp® 

%mmxt£<, mis, v7i>4^i:ot> 

[000 5] Cicofc*, f*OS I MDS/ots't-C' 
tt. S I MDM^^rHtT-rSfcfclc. »#<©^*s 

[000 6] 



4 

co— CT^^-rn-efeSo 

[0 0 0 7] El7[C:}a^-C, Kf± s mz.t£. 3 

KOP> ffl^l/^^rtioiUiAA^^rs, r 

Hr^-y KOPl;#< ©e v vmcSi^b U 

[0 0 0 8] — *\ 7"p/7A©3- K3b^©{g;T£l» 
lhi-5fc«>. ttl77^i/^^r t *i U-v 5 * 4? r 

s, r d ICW 9 Hi -CSt;, M»cSr«f,i-ri:^J:*)» * 
^7yKOP©t*y h^ccottjjp^Srtl^-TS^fetfeS 

So «8tB«5l»B, »»tttlBw^«H4Sr»J5c 

?:ii:fc< % s i MDS^co^^cSr^^i-r ir^pl 

[0 0 0 9] 

9&n%mT~—9 9^1) *>£>n — Ki"S ATI £ , 

* 9 4 7^E«««»-aE« $ ftfcx-* * -f 7°(cSo'v ^ 
T , StllEA^ u- v?^ ^ iz a - K $ ixti^ - 9 (OS* SrtT 

[0 0 10] mtci*). o— 9 
b twfflS-TS 7c(tX% 7=-^ 9 4 rSriESiJi-S r i *s?T 
tEi&D, ^f-994 Lfc^^ ^fl^^asij r t 

{CfflEi"S^S* s ^< ^5:tH, SIMDS^© 

7="- ^ # rE«ta«Sr#figi-5 fc'W T\ t * 

%?-9<r><7-9 9 4 zf*mm-$-Z> Z. b b tt V . 

[0 0 11] *7t, IS^JS2te®coy°nir s/ificiix 
1£ZfrZ>7 : -9-$:mm7"-9 9=eV 7)^d- K1-SA77 

^<?-Y7 = m«#S-T?m^$tvfc7 : -^^'f 7°$-, SfrlEA 
^3 u v?^ 9 $ Ett-t- 5 £ 1 x - ^ 9 4 ^"lEtS 

-9<nm&*'fibm.n&b, m*&ffi.&mz£z>y*-9<r> 



(4) 

5 

[0 0 12] dtUCJ:^ SSMSSfOStTf*-**^ 
^^^fbS^Cfc'^tt, J r — 9 947<D8L'&& 

fc, 3t*«3|E*©^"n-feyi»-^J;Htf, f—9 9 4 7 io 

[0 0 13] rtitcit), 7 f -??-<7 p cDifi^.-&t)-tirtc 

JllHS:^^, SIMD«I©/<!Jx-> 3 y^i| 
^•f-rt^nltgir^So n*&4HML<OT ! —9 9 

[0 0 14] dtUc J; <9 > 7*—9&u*sx9\zci—yf 

-9 94zf\zw& t-tc^^mwmm ^^mmrnt 
bKmmzhx\^z.b*mwb-rz><, 

[0 0 15] dttfeit), 7-9 94 ^*f^Lfc^ 

S'J-rSdt/JSflTSgi:*?), S I MDSM©#^ftS:« 30 

- K*ffi(cj;n(i\ p- K*fr<fr -C«r5£ £ ft 3 f*- * * u 
•^?HB-Kt5^f5/^i:« huIEb- K$ftfcx- 

TiEii-rs^x ant 2.3 r. i zmmb-tz. 

[0 0 16] dfttci'J, u*?*9\zu — VStlltT*— 

9%%i<o&t?mz^ *<D7 ! —9 94-y'i>mmiz%iy&tf 

ZbfrX'%, *-W^Sr-^-^5fcttT\ ?—9 9 4rf 
\zfcC1tm3-*ft? ZbtfBfmbtSZ^bfrb, SIM 

#*7 8Ett©fl**#8sfc:.tfttf» b-KA*-C»«**u 
a^-^SrA^i^^KB- Ki"5^^ . ffrlE 
b- K£ftfcx-*lcm^5f = '-**-'1'>' B £, BufEA 
JjuVx?\z%tJ&£ltXUmi-?>X7 L y7'b, itufEA;^ 
u-v^Icb- K$ft/Lx-*£, Sfrfaf— 994 zfb 
bh\z.mt&\^1j-rzx<? yfh. WtE.7-9 9 47 
\zm<i\<"Xtl(i3Zy-9<Dm&Zft?7.<r-y7'b*ffiZ.Z 

[0 0 17] ClftfCj;^ B — K^^Stt 1 — 9 9 4 ~fzl 
b^mn-f%til-fX% 7 s — 9 9-4 T'SrEgiJi-Sdir^pj so 
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Mbte<9. S I MDM^^^mtr^Lb-r^bi^m 
bteZbbhlz^ m—<Dfij3tZ5-x.?>til-tX\ 7-9 9 
4 7\zfcCtz.fo%*ftoZ.b&yiimbtl:Z>Z\b1)>b, % 

[0018] n&m8mm<D j r-9 94 ^5*f* 

fetcfcftff, b K^T^^^tv^x-^SrA^u-v 5 
??l:o- K-fS^r- , MiEB— K£ft?t-f — ^ 

■yrTIEti-rS^f y?b. mffEA^ w^^tP-KJ 
ftfcx-^Sr, HufE:r-y-^:/i £ fctaMISKiA* 
t5^fj/yi« Buffix— ? * -Y ^ThusEx— 

t)isis*9\z.mm.irz>* : rv7b. imvf—9<Dm.'m& 

*tc*fJ5S Lit ; r—9 94^. ffrfsaj^i^v 5 ^^};^ 

[0 0 19] dft»cj;tj, KJMSIMaSX-C'f'-^-r 
^^M«ri-5d^^plfi6t^t9, mW&m\z£t)lr-9 
9 4rf&££>Zm&\z*$\,^Xh, J f—9 94-f<D&&>& 
&mftirZz.b&xZZ> 0 l»#*9IE*©A-fr:/ 
p^7.M;:J:ftfc£, S IMDf^©f-??^7'Sro 

^^ftS-r-^SrUv^^fCB- K-TS^-r y/t, 
huIBb- K^T«$Ji5f-^^^^> tulBi/v? 
* * IC $ -tirT SEtti" 5 * 7" -y 7° i £ a ^ t° =l - ? fc 

$ * 5 d t ■& m Wt b ir % . 
[0020] rHic J: 19 . 7*—9&ui?x9i l zu—\?ir 

— 9 9 4 zf\zMfi> L £r ^^SgiJ r i Hffljti- * 

-rr ijjsBrffii**. s»*«i otett(D^7°B 

K^tc i "J fifeirZx T~y-fb. ttifE p - KA-frt?# 
^$tv5f— ^SrUv'^^l^B— Ki-S^fs/T'i, tu 
IEp— K^T-fg^HSf-^^ fltlEUv?^ 
?{C^$*TEtti-5^-r 7^, iulEUv ; y.<?(^P 

iwA^J-TS^xs'^i:, SfifEf-^-r^tcK^VT^ 
ffir8Ej»W«fc SrfTi?** y 7 s i: £ = ^ tf = 

[0 0 2 1 ] dtb(cj;i9> p- K^^x-^^-TT'r 
b Izmm-tZ tzl-fX\ ¥-9 9 4 7"SrESiJ-f-5 r t *s bJ 
«B S IMDM^©^-fr«S:«e>-t-di:*s^rtB 

[0 0 2 2] Bl*qil lE«0^n^5A|; 

i^H S IMDS^rof-^^-Y/SrP-K^IC 

7= — ^SrA^JU-v^^^tCB— KtS^f y^t, huIEb 
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\z a - K £ tilt?- 9 £ . Mffix - ^^iifcfci* 
©Hie Xfc-f 5 * ?=■ y 7" £ , ItFlB^- 9 9 -f ^tc^o'v % 

[0 0 2 3] rfUdJ:*), ^g^lCJESCTf*-**-!' 
[0 0 2 4] 

[0 0 2 5] Ull {C*J^T, X — r?^*y 1 Clif-^ 

^te^a-K o-K^ttot, ?—9$:\si/*97 

2 a~2 cutset "btbSi: £ fcld, T — 9 94zfmm.m 

3f4> 2 a~2 c (^{t^n, m?3CDU 

•^^2a~2cl:MJSlt 1 f-994 7lBflMHK3 
a ~3 c fjjxTt>5o 

[0 0 2 6] B2I4. 1 MMftmiZ&Z Uv? 

T\ ^^2a~2cl:|i, fliJxfi, 3 2t'yKDf 

-^1E*«« 1 1 asKtt&iT,, tt— *IEttSS« 1 1 

(4. 2 tr s/ h ft(D?- 9 94 7TE«S?« 1 2 * f+lp $ *i 

[0 0 2 7] ^ "f—9 94 ~fh Lt, #!lx.f4, 8 
f y 1- <D 4 y ? -t — 9 . 1 6 f 7 h0 2^^y? x — 
*, 3 2t"y hWf-?, S«l'>i^»Sf-?04l 

?iLt, 8 t's/ h<D4^y*7*— *lcf4 "0 0" , 1 
6 t'y hCO 2 /■* y ? x — * (C[4 "01" , 3 2 t'y hO 
?-9[zit "10" , J&s^-^t:: 14 "1 

1" *s«t>a-Cfe*fCV^5o 

[0 0 2 8] -t LT> 8 t's/ h©4/^7 9?—9fr?— 

1 2tci4, ^ (d-t- 9 mmmmi hzmi&lx "oo" 

1 6 t's/ K©2^s/^T : '-^^7=_^fElt 

shu ncfett^tbst, ?-9 9 4^ummmi 2\z 

(4, %<D7 ! -9W& < m.m.l lKttJSLT "0 1" jJsteiH 
^ 3 2 f h0>^-*j»s-?*-*'IE«fB«l 1 fcfctfl 

CIMIKl ll«Lt "10" 

.^^-^^x-^tfitt^i i {c*&iw$ax5t, x 



(5) 

-9 94-?ffl&m®i 2ic(4, •e©7 f -^aE««*i 1 

KfcnSLT "1 1" tmiftZtlZo 
[0 0 2 9] S I MDjSg£§4f4 > ^X^77-f* 
2, x-^^'1'7'iE1t^3tJj;^#^T : =>-^5(C^ 
fc$*VCl^3o tUT, Uv ; ^^7r-r^2(CfSti$H 

\,^y-9 9 4^tth\zm^^, ^(T>?—9 9 4-7 

izfcctcmw*fro<, tLt, zcomn&%:*^i?x9 

[0 0 3 0] ift^x'a — 95 (4> /o^A^^Ue^ 

^2, x-^i?-<^IElSSa«c3*J40JS IMD?aSI§4 

(4, ft-fra- KaM&HSiV ^ = - KSr^^a-^ 
5lc|±i^-r?)o n- K^(4« ?-9 9 4-fz: 

I o o 3 1 ] m 3 (4, *$m<»% i 

0iJS:^:-rmT*fc?)o m3tC*5V^T, n— K<?HH4. 
20 [H2<08 t's/ h04^y^f- *> 16tVf(D2^s' 

^-*, 32 1- s/ h <d?- 9 » a ifftWi&m.tm&.y 1 

— 9\ZttfcLX, 4ia©^LDPB, LDPW, L 
DDW, LDF SiWRtte>*L<5 c JPll J ««i*if 

[0 0 3 2] ^:^c N El 1 cD^nir ^/f-coiftf^^o^Tlft 
^5 * y 6 4^ p - KA^It*fflS 
tiZk (®) , ^^3-^5f±, v-h*&^*Mmir 
5o *Lt, *f—9** If 1 *>&tt*m£;h/fc'7 f — *Sr 
30 (®) N d— K^-eJg^$ixfcu^^^ 2 a -2 c \Z 
IB«$*S (®) o p-K*^tt, ^-^^>r 

Uy^^ 2 a — 2 c — ^^-<^lfi1S9lttE3 

[0 0 3 3] fell 3 

T-T^2 2 a tf^l/^? <h UTJt^$ 

40 ^LDPBSrft?K-r^,ri:ie:J:t9, p-K^t^S^ 

7—9 947omBi\*f—9h It "00" *rflMW-5 0 
[0 0 3 4] ^:UT. ^7 3-^511, 1(7)8 t:^ h 

tC, I©!/^^ 2 a CSJSLt^it^iXtt^f- # 
^^TTEtfttWKS ate "0 0" $:fB«$-&^ 0 Ztl^X 

v 1 ^^ 2 a -2 c tCfatg^ttS-r-^cOx — ^#>T ^4: 
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[0 0 3 5] KkK. ^o^A^!) 6*>b»0Mtfi-*S 

i^tb^nst (©) , ^f^-ysft 

^5:SIU S I MD«»«4te*J|pflr*«rm*-r« 
(®) o S IMDS»4ft A^a-^S^feWfP 

a — 2 c*»b-7*— #S:«i9iitfi: itC (®) . ^cou 

-3 ca>«bf*-*#>r:/£E!9&tr (®) 0 ^UT, ^ 

■^T^itttC (®) % WfflM(?)^^2a-2c 
**^:/|EtMI«3 a -3 c^x-^^ 

[0 0 3 6] 0J;ifi, ^P^7A^^y 6A^aP3**^ 

(0^^2a, 2 b^A^l-v^^ £ LTjg^tl, 

4^Jp»Jg^Srfr5 £ t fefc, ^#2a, 2 b £r A 
Ai/^^i:LtStU u-v 1 ;** 2 c SrUJ^ju^^ 2 o 

[0 0 3 7] S IMDSg^4[j:, ^^2a, 2b 
t LXmi£&inZ>b, ^^2a, 2 

2 bfcS^lSLfcf*-**^ ^IB«««3 a, 3bHr 
-^^-T^^tt^tPo ^UT, S I MD»*«4«:, 
Z<D"f— 99^( ~7\Z.%iX, U^^2a, 2b<Df- 
*Sr*P*U u^** 2 cteHtfrt-Si: 
fcfcfc, 2 c fc#JfcUi^-**>f:/|BlB 
H«3 c^-^^-Y^SrW^-rSo. 30 

[0 0 3 8] afcJIA^»4f !r -*#>f^lc*ate 
Kttbft. <HH*7*3-^5H\ 7 p D^7^^!i6^b 

7 P S:*J»fi~^cOTfi^<. f*-**^:7TBtM«3 a- 

3 clc|E*Six-C^*^-^K*<5t^T, 

[0 0 3 9] ZtlVlXV. 7—9 947\ZLni&\*1Zfirft 
S IMD«*#4fcJ:5fllJlttj|ltc*-Jv^T!E»fi-5J: 

[0040] 0 4f4, *mm<om2nifcMmz.i&Z)7*- 

6fy K^2/>V^f-^a 0, a 1 jftS-r — so 



7(9 

1 1 atcfE«$tt5i:it>fc, f-^^^7 P iSiJf-^ 
"0 1" — :/ffi1MMEl 2 a fclBfftStb, 1 

I l bfclB«£*L5i: t *f—9 9 4-fmf&-9 
"01" ^IE«««l 2 bfclEHS^ r 

[0 04 1] a 0 £ b 0 9 3 2 

a 0 X b 0**^— *1E«««1 1 c iCfB®$tl5£ £ h 
tC, a 1 £ b 1 h<DfkM\Z£ 9 3 2 fcfy hCDT 2 — 
J*^, ^(D3 2 y+y h^)flg|:a 1 x b 1 t^f—9 
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